AITopics | data validation

Collaborating Authors

data validation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Scenarios Engineering driven Autonomous Transportation in Open-Pit Mines

Teng, Siyu, Li, Xuan, Li, Yuchen, Li, Lingxi, Ai, Yunfeng, Chen, Long

arXiv.org Artificial IntelligenceMar-14-2024

One critical bottleneck that impedes the development and deployment of autonomous transportation in open-pit mines is guaranteed robustness and trustworthiness in prohibitively extreme scenarios. In this research, a novel scenarios engineering (SE) methodology for the autonomous mining truck is proposed for open-pit mines. SE increases the trustworthiness and robustness of autonomous trucks from four key components: Scenario Feature Extractor, Intelligence & Index (I&I), Calibration & Certification (C&C), and Verification & Validation (V&V). Scenario feature extractor is a comprehensive pipeline approach that captures complex interactions and latent dependencies in complex mining scenarios. I&I effectively enhances the quality of the training dataset, thereby establishing a solid foundation for autonomous transportation in mining areas. C&C is grounded in the intrinsic regulation, capabilities, and contributions of the intelligent systems employed in autonomous transportation to align with traffic participants in the real world and ensure their performance through certification. V&V process ensures that the autonomous transportation system can be correctly implemented, while validation focuses on evaluating the ability of the well-trained model to operate efficiently in the complex and dynamic conditions of the open-pit mines. This methodology addresses the unique challenges of autonomous transportation in open-pit mining, promoting productivity, safety, and performance in mining operations.

autonomous transportation, intelligence, transportation, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/DTPI59677.2023.10365481

2405.0069

Country:

Oceania > Australia > Western Australia (0.24)
North America > United States > Indiana > Marion County > Indianapolis (0.04)
Asia > China > Hong Kong (0.04)
(4 more...)

Genre: Research Report (0.50)

Industry:

Materials > Metals & Mining (1.00)
Transportation > Ground > Road (0.50)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback

Curriculum Learning and Imitation Learning for Model-free Control on Financial Time-series

Koh, Woosung, Choi, Insu, Jang, Yuntae, Kang, Gimin, Kim, Woo Chang

arXiv.org Artificial IntelligenceJan-12-2024

Curriculum learning and imitation learning have been leveraged extensively in the robotics domain. However, minimal research has been done on leveraging these ideas on control tasks over highly stochastic time-series data. Here, we theoretically and empirically explore these approaches in a representative control task over complex time-series data. We implement the fundamental ideas of curriculum learning via data augmentation, while imitation learning is implemented via policy distillation from an oracle. Our findings reveal that curriculum learning should be considered a novel direction in improving control-task performance over complex time-series. Our ample random-seed out-sample empirics and ablation studies are highly encouraging for curriculum learning for time-series control. These findings are especially encouraging as we tune all overlapping hyperparameters on the baseline -- giving an advantage to the baseline. On the other hand, we find that imitation learning should be used with caution.

artificial intelligence, machine learning, validation, (17 more...)

arXiv.org Artificial Intelligence

2311.13326

Country:

Asia (0.14)
North America > United States (0.14)
Europe > Germany (0.14)

Genre: Research Report > New Finding (0.48)

Industry:

Energy > Oil & Gas (1.00)
Banking & Finance > Trading (1.00)
Information Technology (0.68)
Materials (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

YOLOv5 : Violation Detection on the Roadside of the Toll Roads

#artificialintelligenceApr-10-2023, 19:50:18 GMT

Based on Indonesia's Government Regulation no. 15 of 2005 Article 1 paragraph (2), toll roads are public roads that are part of the road network system and as national roads whose users are required to pay tolls. Toll roads can also be referred to as expressways. However, from the many advantages of using toll roads as transportation routes, there are still many accidents that occur on toll roads. The high number of accidents on the toll roads is mostly caused by human negligence. An expert researcher at the Center for Transportation and Logistics Studies (PUSTRAL) UGM said several factors causing accidents on toll roads include driver negligence, vehicles, environment and roads, and weather.

dataset, toll road, yolov5, (12 more...)

#artificialintelligence

Country: Asia > Indonesia > Java > Jakarta > Jakarta (0.05)

Industry:

Transportation > Infrastructure & Services (0.56)
Transportation > Ground > Road (0.56)

Technology: Information Technology > Artificial Intelligence (0.33)

Add feedback

Data validation in Python: a look into Pandera and Great Expectations

#artificialintelligenceMar-30-2023, 08:04:43 GMT

Liam studied an MSci in Physics at University College London, which included modules on Statistical Data Analysis, High Performance Computing, Practical Physics and Computing. This led to his dissertation exploring the use of machine learning techniques for analysing LHC particle collision data. Before joining endjin, Liam had a keen interest in data science and engineering, and did a number of related internships. However, since joining endjin he has developed a much broader set of interest, including DevOps and more general software engineering. He is currently exploring those interests and finding his feet in the tech space.

data validation, pandera and great expectation, python, (4 more...)

#artificialintelligence

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Data Validation and Data Verification – From Dictionary to Machine Learning - KDnuggets

#artificialintelligenceDec-19-2022, 18:17:09 GMT

Quite often, we use data verification and data validation interchangeably when we talk about data quality. However, these two terms are distinct. Table 1 explains dictionary meaning of the words verification and validation with a few examples. To summarize, verification is about truth and accuracy, while validation is about supporting the strength of a point of view or the correctness of a claim. Validation checks the correctness of a methodology while verification checks the accuracy of the results. Now that we understand the literal meaning of the two words, let's explore the difference between "data verification" and "data validation".

data verification, machine learning, training data, (11 more...)

#artificialintelligence

Country: North America > United States > Illinois > Cook County > Chicago (0.06)

Technology:

Information Technology > Data Science > Data Quality (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Comparing Shape-Constrained Regression Algorithms for Data Validation

Bachinger, Florian, Kronberger, Gabriel

arXiv.org Artificial IntelligenceSep-20-2022

Industrial and scientific applications handle large volumes of data that render manual validation by humans infeasible. Therefore, we require automated data validation approaches that are able to consider the prior knowledge of domain experts to produce dependable, trustworthy assessments of data quality. Prior knowledge is often available as rules that describe interactions of inputs with regard to the target e.g. the target must be monotonically decreasing and convex over increasing input values. Domain experts are able to validate multiple such interactions at a glance. However, existing rule-based data validation approaches are unable to consider these constraints. In this work, we compare different shape-constrained regression algorithms for the purpose of data validation based on their classification accuracy and runtime performance.

constraint, data quality, machine learning, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-3-031-25312-6_17

2209.09602

Country:

North America > United States > New York > New York County > New York City (0.05)
North America > United States > California (0.04)
Europe > Spain > Canary Islands > Gran Canaria > Las Palmas de Gran Canaria (0.04)
Europe > Austria > Upper Austria > Linz (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Data Science > Data Quality (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.87)

Add feedback

7 Considerations Before Pushing Machine Learning Models to Production

#artificialintelligenceOct-31-2021, 23:40:20 GMT

Being part of a company that values scalability, I daily see, as a data scientist, the challenges that come with putting AI-based solutions in production. These challenges are numerous and cover a variety of aspects: modeling and system design, data engineering, resource management, SLA, etc. I don't pretend mastery in any of those fields. I do however know that implementing some software engineering principles and using the right tools helped me a lot in making my work reproducible and ready for production. In this article, I'll share with you 7 of the considerations I have in mind before productionizing my models.

environment variable, machine learning model, towardsdatascience, (16 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Serving a Machine Learning Model with FastAPI and Streamlit

#artificialintelligenceJun-29-2021, 13:20:48 GMT

Machine learning is a hot topic at present. With technology companies moving in the direction of artificial intelligence and machine learning to cash in early, the field has grown tremendously large. Many of these companies create their own machine learning solutions and sell them to others using a subscription-based model. Since the majority of machine learning models are developed in Python, the web frameworks that serve them up are usually Python-based as well. For a long time, Flask, a micro-framework, was the goto framework.

backend, fastapi, streamlit, (12 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Data Validation in Machine Learning is Imperative, Not Optional - KDnuggets

#artificialintelligenceMay-24-2021, 13:25:00 GMT

Operationalizing a Machine Learning (ML) model in production needs a lot more than just creating and validating models like in academia or research. The ML application in production can be a pipeline with multiple components running consecutively as shown in Fig 1. Before we reach model training in the pipeline, there are various components like Data Ingestion, Data versioning, Data validation, and Data pre-processing that need to be executed. Data validation means checking the accuracy and quality of source data before training a new model version. It ensures that anomalies that are infrequent or manifested in incremental data are not silently ignored.

constraint, machine learning, validation, (11 more...)

#artificialintelligence

Country:

North America > United States > California (0.15)
Oceania > Australia (0.05)
North America > United States > Illinois > Cook County > Chicago (0.05)
Asia > India > West Bengal > Kharagpur (0.05)

Technology:

Information Technology > Data Science > Data Quality (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Marketing Data Scientist (copy)

#artificialintelligenceMay-13-2021, 17:01:01 GMT

As an industry leader and Software-as-a-Service provider our mission at 8x8, Inc. [NYSE: EGHT] is to transform the future of business communications. The 8x8 Open Communications Platform (TM) uniquely brings …

data management, data validation, machine learning, (12 more...)

#artificialintelligence

Country:

Europe > United Kingdom (0.06)
Europe > Romania (0.06)

Industry:

Marketing (0.41)
Information Technology (0.35)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.79)

Add feedback